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System and method for inferring geological classes 

FIELD OP THE INVENTION 

This invention relates the enhancements of neural network- 
assisted reservoir characterization technicpies for geological 
classification from measured input data. 

According to the present invention, the terms "measured input 
data" or "INPUT DATA" refers to, in particular, downhole logs. 
The set of logs used in the testing of the method of the 
invention includes gamma ray <GR) , sonic slowness (DT> , thermal 
neutron porosity (NPHI), bulk density (RHOB) and true 
15 resistivity (RT) , all measured at same depth for each sample, 
and at a constant sampling distance. However, INPUT DATA are not 
restricted to samples at a single depth. Alternatively, 
attributes that represent, for example, sliding window averages 
or other statistics taken over a depth range in the neighborhood 
of the depth of interest, can be constructed. 2D image logs 
(e.g., FMI) or 3D seismic cubes are also encompassed. 



According to the present invention, the terms "geological 
classes" or "CLASSES" refers to, principally, the rock facies 
(lithofacies) or the reservoir rock types. However, any other 
discrete classification of geological features (e.g. 
petrophysical properties) is possible. 



30 



PRIOR ART 



Rock facies class prediction by neural network processors 
applied to downhble logs is an existing method developed in the 
nineteen nineties which gave rise to several publications [1] - 
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For instance, it has been implemented by an ENI AGIP E&P team, 
and integrated in a joint development project into the product 



For rock facies estimation, a set of single-channel log curves 
are selected. Typical logs used are gamma ray (GR) , sonic 
slowness (DT) r thermal neutron porosity (NPHI) , and bulk density 



10 generated from existing logs in order to reveal additional 
features in the logs, 

A current limitation in analyzing geological measured data such 
as downhole logs, is that their relationship to classes such as 

15 rock facies is not obvious. In each borehole, there are unknown 
local factors that may affect the data in unexpected ways. It 
can thus be risky to classify on a simplified theoretical 
analysis or by data clustering. There is a need for a method to 
identify associations between input data and to build implicit 

20 complex functional relationships. A u learn from examples" method 
is more preferred to building an expert system. The discovered 
metbods would then be used to predict the classes and their 
associated probabilities. 

25 An Artificial Neural Network (ANN) scheme has been developed to 
implement learning by example as applied to downhole geological 
classification, Neural networks can "learn" specific computation 
schemes. Once trained, a neural network can find acceptable 
solutions on any set of data referring to tbe learned schemes. 

30 This gives artificial neural networks an ability to generalize 
from training experience Csee [12]). Unlike analytical 
approaches such as statistics , neural networks require no 
explicit computational model, and are not limited by a lack of 



Rockcell™ within the Schluniberger 1 * GeoFram&™> oilfield 

interpretation software platform. 



(RHOB), but this list is not limited. New attributes can also be 
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normality or the non- linearity of the physical phenomenon. As a 
consequence, they "learn" relationships between data that may be 
hard to discover with analytical methods. 



5 The behavior of a neural network is defined by its architecture * 
This architecture consists of the way its neurons (individual 
computing elements) are connected and by strength (weight) of 
those connections. Each neuron performs a weighted sum (linear 
combination) of its inputs, then applies an almost non- linear 

10 activation function, to finally produce an output. The resulting 
output of a given neural layer is forwarded to * the next layer 
and so on through the" network. In other words, neural networks 
plainly perform a massively parallel set of elementary 
computations. , Whereas the weights vary the strength of 

15 connections from one node to another, the sigmoidal activation 
function provides . the highly non-linear property of neural data 
processing. 



The main advantage of those neural nets is their learning 
20 capability . During the learning phase, given a training set of 
data, the interconnection weights are gradually adjusted so as 
to stabilize the network's output, - and, in the case of the 
supervised learning, to minimize the mean square error between 
the effective output and the desired one. The preferred 
25 implementation of the KIN is a supervised -f eed-f orward, multi- 
layer perceptrons trained with the back-propagation algorithm. 



Methods and techniques used today are able to classify without 
the a priori knowledge of classes sequencing. The . prediction 
30 operates on geological input" data sample-by- sample, and produces 
for each input pattern the probabilities of the most likely 
classes . 
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However, this system sometimes fails in its predictions. One of 
its main limitations is that it does not honour geological prior 
knowledge. Some of the predictions fail due to the fact that 
geologically improbable classes transitions are often observed- 



Sedimentologists have observed that the vertical and lateral 
sequence of geological facies 1 seen in outcrop and in the 
subsurface are not random. Since the stratigraphic layering in 
the earth represents successive time of deposition , the rock 

10 record actually represents a time series of events. Since the 
normal neural net techniques make sample-by- sample predictions, 
they do not consider previous states of prediction (e.g., the 
facies predicted at location X^i, which implies t n -i, constrains 
the prediction at location X) and they fail to take advantage of 

15 likely non-random transitions between lithology or facies. 
Geology cian provide strong constraints on the prediction of 
stratigraphic successions- Sedimentologists have long invoked 
Markov models for analyzing the vertical and lateral sequences 
[2, 3, 4, 5]. Therefore, using a Markov scheme using geological 

20 prior information of rock facies transition probabilities seems 
a fruitful way to improve the prediction of the neural network 
scheme. 

Systems for speech recognition, integrating a neural network and 
25 a Hidden Markov Model (HMM) , are known from the state of the 
art. HMMs are used as a major approach in the majority of 
continuous speech recognition systems. They provide an accurate 
and reliable framework for segmentation and classification of 
speech. HMM states can stand for the phone classes, c± (e.g., 
30 phonemes) to be identified, whereas the HMM observation sequence 
for the acoustic vectors y (e.g., a combination of cepstral and 
energy acoustic parameters). As a consequence, the state 
sequence X •= jcj, x^, . . . , of length T can be considered as 
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the "Bentence" to be recognized due to the' recorded and 
discretized acoustic observation sequence 3T - y z , Y2> •■-> 

Facies sequences have been considered as analogous to the 
5 phoneme sequences in the speech recognition methods. The HMM and 
its stochastic behavior represent the allowed or forbidden 
transitions between geological classes and their associated 
probabilities, and the geological input data are analogous to 
the acoustic observation vectors used during the speech 
10 recognition process. 



The HMM technology has already been applied to lithofacies 
classification from well logs. Publications [6], 17] and [8] 
describe the building, training and application of a Hidden 
15 Markov Model to estimate the lithology of uncored boreholes 
based on key learning data sets where the lithology is known. In 
those methods, the lithofacies sequence stands for the 
consecutive states of the HMM, and the log data for the 
observations. Those methods do not rely on the use of a neural 
20 network. This means they are able to model the stochastic, 
character of rock facies transitions and the rock facies 
sequences. However, they perform poorly while modeling the non- 
linear relationship between logs and rock facies, as they do not 
benefit from the complex neural network architectures and 
25 computation schemes. 

In the papers [ft] and [10], and in several patents concerning 
speech recognition, such as [11], an interesting approach to 
classify speech phonemes has been developed by the use of hybrid 
30 models mixing both HMM and ANN. Those approaches enable speech 
recognition systems to cope with the strong statistical 
assumptions of the HMMs. 
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Applying a feed-forward neural network to the input data y can 
give us estimates of the conditional posterior probabilities 
p{x±jy) of each class x±, given the current input vector y. * 
Those class-conditional posterior probabilities must sum to one, 
5 and therefore need to be normalized. However, a HMM needs the 
conditional prior probabilities p(yfx±>. Assuming there are 
enough training data and that the training does not get held up 
in poorly performing local minima, the feed-forward neural 
network is able to approximate the prior probabilities thanks to 

10 Bayes' rule. Indeed, p(yjxi) - p(x±jy) x p(y> / p(x±) , The prior 
probability distribution of classes is context-dependent but can 
be estimated by counting the classes occurence of classes in the 
learning set, or by introducting prior knowledge. The prior 
probability of the observation vector can be discarded as for 

15 each time step; it is independent of the phone class. 

The HMM and observation sequence finally provide, thanks to the 
Viterbi algorithm, the most likely state sequence which caused 
the observed acoustic data sequence. 
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SUMMARY OF THE INVENTION 

The starting point of this invention consists of enhancing the 
5 neural networks algorithms to make their predictions more 
accurate and robust in oilfield applications. 

In a first aspect, the invention concerns a system for inferring 
geological classes from oilfield well input data comprising a 
10 neural network for inferring class probabilities, characterized 
in that said system further comprises means for integrating 
class sequencing knowledge and optimising said class 
probabilities according to said sequencing knowledge. 

15 Preferably, the means for integrating class sequencing knowledge 
and optimising said class probabilities according to said 
sequencing knowledge comprises a hidden Markov model. 

in a second aspect, the invention concerns a method for 
20 inferring geological classes from oilfiled well input data, 
comprising the following steps: - inferring class probabilities 
with a neural network; and - integrating class sequencing 
knowledge and optimising said class probabilities according to 
said sequencing knowledge. 

Preferably, integrating class sequencing knowledge and 
optimising said class probabilities according to said sequencing 
knowledge is achieved according to a hidden Markov model. 

Advantageously, the invention relates to a system and method for 
inferring geological classes from single-channel oilfield input 
data by applying hybrid neural network hidden Markov models 
classifiers. 



25 



30 
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The geological classification ia inferred using supervised 
neural networks that are applied to the input data and that 
predict the associated classes. The vertical class transition 
constraints are learned within a Markov class transition table 
5 and a prior class distribution, which are then reused during the 
estimation of the classes. This optimizes the predicted class 
curve and honours geological prior knowledge. 
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This invention relates the enhancements of artificial neural 
10 network (AMW) reservoir characterization techniques for 
geological classification. Supervised neural network classifiers 
can be applied to downhole logs to automatically predict 
lithology or other classes in boreholes* However, ANN systems 
sometimes infer geologically incorrect vertical (stratigraphic) 
15 class transitions within a borehole. The root cause of these 
errors is the fact the networks analyze and predict the output 
classes sample-by- sample, without taking the whole borehole 
sequence of classes into account* Improving the prediction of 
lithofacies from downhole logs is solved by the system and 
20 method of the present invention. In essence, they do not take 
into account -local information that is commonly important in 
stratigraphic rock sequences. Geological transitions are 
commonly not random, but predictably sequenced. 

25 Tiie system outlined here integrates an a priori knowledge of 
class sequencing and of class probability distribution in the 
neural network predictor, xt consists in combining a supervised 
back-propagation, feed-forward neural network architecture with 
a Hidden Markov Model module into a complex hybrid neural 
processing chain. The second processing step optimizes the class 
stratigraphic sequence. Instead of simply choosing, for each set 
of input data the class that is the most probable, the chosen 
class is the one which has both a reasonable occurrence 
probability given the input data pattern and . a reasonable 
35 occurrence probability given the previous estimated class. Such 



30 
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a choice governed both by class transition and a posterior class 
observation probability is implemented through the Viterbi 
algorithm . 



5 These and other features of the invention, preferred embodiments 
and variants thereof, possible applications and advantages will 
become appreciated and understood by those skilled in the art 
from the following detailed description and . drawings . 

10 DRAWINGS 



Figure 1 is a block-diagram of the training of the hybrid 
ANN-HMM classification system. The training set consists of 
INPUT DATA across several wells and associated core information. 
15 The normalization of INPUT DATA and the generation of additional 
attritutes is optional (dotted arrows) - The construction of the 
HMM during the training phase is optional as well, if it is not 
essential to compute it for the training data set. 

Figure 2 is a block-diagram of the estimation of the hybrid 
20 ANN-HMM lithofacies classification system 011 uncoxed boreholes 
by applying the system to well logs. The normalization of INPUT 
DATA and the generation of additional attritutes is optional 
(dotted arrows) . As for the HMM, one can load an existing HMM 
from the data storage system and / or manually define it on the 
25 basis of the geological prior knowledge. 

Figure 3 is a block diagram of the Hybrid ANN-HMM 
processing chain in the geological classification estimation 
mode. The supervised, neural network module aims to predict the 
posterior class, probabilities given an observation. The Viterbi 
30 processing optimizes the predicted class path. 

Figure 4 depicts the same process as in figure 3, but with 
a state- to-state HMM instead of a sample-by-sample one. 
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Figure 5a shows a particular ANN architecture where the 
neural network integrates a Kalman-trained matrix K. 

Figure 5b illustrates the concept of neural network expert 
committee. 



5 



MODE(S) FOR CARRYING OUT THE INVENTION 



The Hybrid ANN / HMM is composed of two different components, 
which are the ANN posterior CLASS probability estimator, and the 
10 HMM, comprising only a CLASS transition table .and a CLASS 
probability distribution. Those components are trained 
separately during the training phase of the system, as they do 
not need to interfere with one another during the learning step. 
They are also applied separately during the estimation step. 

15 

1. Data choice and input 

Processing of the INPUT DATA is done on a sample-by- sample 
basis, and therefore the CLASS probabilities are estimated for 
each sample . 

20 

1.1. Borehole choice (see step 1.1 to 1.3 on figure 1 and 
2.2 on figure 2) 

Both the learning and the estimation of the Hybrid HMM / ANN 
25 classification system can be done on several wells, as long as 
they share the same geological INPUT DATA and properties. This 
system - is by consequence designed to propagate the knowledge of 
the physical and statistical relationships between INPUT DATA 
and CLASSES, as measured in one or several wells, to the whole 
30 set of boreholes within an oilfield. 
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If one or more INPUT DATA curves are missing, they can be 
estimated thanks to available data. One can for instance 
integrate synthetic logs so as to perforin rock class estimation. 



figure 2) 

The following section is applicable if we use log curves as 
INPUT DATA. For the purposes of testing and validating the 

10 method outlined here, the set of logs used includes gamma ray 
(GR) , sonic slowness (DT) , thermal neutron porosity (NPHI) , and 
bulk density (RHOB) , all measured at same depth for each sample, 
and at a constant sampling distance. A regression is often done 
on those data, which means that ah each depth of interest we 

15 take some samples above and some samples below that current 
depth - 

However, any ID, 2D or 3D information is suitable for this 
system* A first solution is to extract ID depth-oriented 
20 attributes from the existing information. Another one is to 
extract sliding window statistics at the neighborhood of the 
depth of interest, and this can be done for instance on 2D FMI 
images or 3D seismic cubes, 

25 1.3. Training data set and cross-validation data set {see 
step 1.3 on figure 1> 

The learning data set must have both INPUT DATA and core or 
geologist-defined corresponding CLASSES zonation. This .CLASSES 
30 zonation of the INPUT DATA is considered as the desired goal 
which has to be attained by the Hybrid HMM / ANN classification 
system. 



1.2. 



Input data choice (see step 1.1 on figure 1 and 2.1 on 
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The supervised training of the ANN component is done for each 
sample of the INPUT? DATA, and for as many epochs as necessary/ 
until a global mean square error between the desired outputs and 
the actual outputs is satisfactory. A second error, the so- 
called cross-validation error, is also computed on a different 
data set, not taken into account when training the ANN. This 
monitors the generalizing abilities of the ANN, preventing the 
ANN from learning the training data set xt by heart". Usually, the 
training stops when the cross-validation error starts to 
increase. 

The separation between training set and cross-validation set is 
done on the basis of random choice. The total percentage of data 
15 being selected for the cross-validation set is chosen during the 
ANN architecture choice step, for instance p 50%. Then, for 
each sample of the INPUT DATA set, that sample is randomly 
attributed to the training set or the cross-validation set 
according to the probability p. 

20 

1.4. Additional attributes generation (see step 1.4 on 
figure 1 and 2.3 on figure 2) (OPTIONAL) 



5 



10 



As the INPUT DATA may not contain, on a localized sample-by- 
25 sample basis, enough spatial information which could help to 
discriminate among CLASSES, additional information may be 
extracted from existing data, e.g. seismic data or logs. That 
information can for instance, show the evolution of the INPUT 
DATA curves, the. energy contained in the curves or the smoothed 
30 low- frequency component of the curves. In order to obtain such 
information one can apply a set of gradients to the INPUT DATA 
curves, or extract low frequencies thanks to the Past Fourier 
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Transform, or approximate the INPUT DATA curves with Polynomial 
curves on small windows. 

This additional attributes generation is done both for the 
5 -learning data set and for the estimation data set. 

1.5. Log data normalization (see step 1.5 on figure 1 and 



10 In order to enhance the generalization performance of the ANN, a 
pre-processing step consisting of normalizing of the data can be 
applied. It can consist of one or more of the following 
computations : Mean - Standard Deviation Normalization, 
Principal Component Analysis (with retention of the principal 

15 components which account for 95 or 98 % of the data) , Mininum - 
Maximum Normalization- Note that this list is not exhaustive, 

2 . Neural network component 

2 0 Several architectures, training algorithms, and methods of 
implementation are possible for the neural network. The 
component is a feed- forward MLP (Multi-Layer Perceptron) , with 
an input layer (one neuron per log data attribute) , one or 
several hidden layers, and an output layer (one neuron per 

25 CLASS) . The outputs O = (o x , o 2 , ~ o N ) of the ANN have to be the 
probabilities . of each CLASS, according to the current INPUT 
DATA, and as a consequence have to equal 1 and each belong to 
the interval [0, 1] . 



2.4 on figure 2) (OPTIONAL) 
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2.1. Choice of the neural network architecture (see step 
1.6 on figure 1 and figures 5a, 5b) 



A preferred embodiment of this method is the three-layered 
5 neural network, with: sigmoid activation functions; bias; as 
many nodes on the first layer as there are log attribute inputs , 
for instance 4, then 10 nodes on the first hidden layer and 10 
nodes on the second hidden layer . The number of nodes on the 
output layer is the same the number of CLASSES. 

10 

An additional linear matrix K is added to the ANN after the 
output layer of the neural network. In this case, the last 

neural layer does not need to have as many nodes as there are 
CLASSES, but the linear matrix K has to be correctly sized and 
15 performing the following operation z X = K Xh, where X is a 
vector of size JV (N being the number of CLASSES) , Xh is a vector 
of size Nh coming out from the last neural layer, and K is a 

matrix of size Itfxffii. 

20 No matter which ANN architecture is retained, the ANN modules 
can be combined into a neural network expert committee as shown 
on figure 5b, step 5b,2. 



2.2. Training the ANN (see step 1.7 to 1.10 on figure 1) 

25 

2.2.1. Evaluation of the network performances at each 
step (see step 1.8 on figure 1) 

Evaluation is realized by computing the global RMSE (Residual 
30 Mean Square Error) between the desired outputs as provided by 
the training or the cross-validation data set, and the actual 
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outputs of the ANN* Two curves corresponding to that training 
and cross-validation error are displayed and monitored during 
the training process. 



2,2.2. Error Back- Propagation (see step 1.7 on figure 1) 



The supervised training of the neural network is performed by 
the Error Back- Propagat ion , and the algorithm used can be, for 
instance, Gradient Descent with Adaptative Learning Rate and 

10 Momentum. This means that the difference between the expected 
CLASS probability as provided by the training data set, and the 
actual current output of the ANN, is propagated backwards 
through the ANN and the neural weights are accordingly updated. 
The Adaptative Learning Rate means that this correction is 

15 proportional to a learning rate which is tuned accordingly to 
the evolution of the global RMSE. The Momentum means that a term 
corresponding to the- total sum of the neural weights of the 
network is added to that global error, with the aim of avoiding 
the values of those weights increasing too much. 

20 

2.2.3. ANN committees (see figure 5b) 



Instead of one ANN, one can run several ANN and average their 
outputs (see step 5b. 3}. The training of each ANN module of that 

25 committee is done thanks to a bootstrap procedure (see step 
5b. 1), which consists of slightly altering the training set for 
each ANN (different partition of INPUT DATA samples between the 
training - set and the cross-validation set) , and randomly 
initializing the neural weights of the ANN before training. The 

30 generalization abilities of the ANN are then enhanced. 
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2,2.4. Training of the K matrix by Kalman filtering (see 
figure 5a) 

This JC matrix is trained in the following way: 

5 o At each epoch of the training, a first run of the ANN 
through all the INPUT DATA samples is realized, 
o As a result a matrix Mb is computed. Each row xh of that 
matrix corresponds to the outputs of the last layer of the 
ANN, for a given INPUT DATA sample. 
10 o The training data set is a matrix Mt where each row 
corresponds to the CLASS probabilities JCfc for a given INPUT 
DATA sample. 

o The matrix K is approximated by a Kalman-Filtering 
technique so as to minimize the RMSE of J5? « xt - JC Xh where 
15 E and Xt are vectors of size N {N being the number of 

CLASSES) . 

o Once the matrix K is approximated, the Back- Propagation is 
applied to the ANN for all the training data set samples, 
and the error is propagated through K first before being 
20 propagated through the network. 

2*2.5. Termination of the training (see step 1.9 and 
1-10 on figure 1} 



25 The end-user chooses on how many training epochs he wants the NN 
- to be trained. The training might stop earlier if the cross- 
validation error has begun to increase. If the final 
performances of the ANN are not satisfactory, the end-user can 
tune the ANN parameters and try a different configuration. One 

3 0 can also propose a system where several configurations of the 
ANN are successively automatically tried and the best one 
retained. 
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3 . The Hidden Markov Model component 



3.1. Training of the HMM 



3.1.1. Automatic training on the cored / geologist 
defined lithofacies data (see step 1.11 on figure 1) 



The CLASS transition probability table depends only on the 
10 CLASSES of the learning set and it is therefore an absolute and 
static reference. It is computed by counting the successive 
CLASS transitions. It cannot be influenced by the neural 
predictions, and for a proper application of the viterbi 
algorithm, it should be learnt on a large training set of facies 
15 log curves. It is possible to learn the CLASS transition table 
on a set of multiple wells, and in this case the CLASS 
transitions between two different wells are obviously not taken 
into account in the computation of the CLASS transition table. 



20 These CLASS transitions can be counted on a sample-by-sample 
basis, (i.e. for each INPUT DATA sample) , or on a state-to-state 
basis, grouping all the samples from the same CLASS together 
( see figure 4 ) . 

25 A similar automatic computation is done to approximate the CLASS 
probability distribution. 

3.1.2. Geologist-driven correction of the HMM model (see 
step 1.12 on figure 1, 2.5 on figure 2, and 3.1 on 
30 figure 3) 
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This correction is performed based on the geologist's expert 
prior knowledge and can be done after the automatic estimation 
of the HMM on the learning data set, or before applying the HMM 
to a specific estimation data set of borehole logs. It relies on 
. 5 the CLASS transition probabilities, (and) the CLASS probability 
distribution ♦ In case of rock facies classification, it can also 
rely on the lithofacies bed thickness which the geologist has 
defined in his geological study, 

10 3.2. Combining HMM and ANN 

3.2.1. Applying Bayes' Rule (see step 3,2 to 3.4 on 
figure 3 and 2,6 to 2.7 on figure 2) 

15 The ANN posterior CLASS probabilities estimator is designed to 
work independently from the observation set. Actually, this is a 
plain classifier, providing the probability p(x±) for each 

INPUT DATA sample* However, this can also be expressed as the 
posterior CLASS probabilities for each input data pattern given 
20 the current observation, &(x±jy} . 

A HMM model requires three different elements, which are; the 
state transition probabilities matrix, the state probability 
distribution, and the observation probability matrix given the 
25 current state p(yjx±) . Those three elements are also required 

for the Viterbi algorithm described below* 

In order- to get p(yfjc±), one needs to apply the Bayes Rule and 
to introduce the observation (INPUT DATA) probability 
3 0 distribution and the state (CLASS) probability distribution. 
However, as the INPUT DATA observations are continuous, and as 
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for each depth step of the Viterbi algorithm that observation 
probability remains the same for all the possible CLASSES, one 
can just discard the p(y) value. 

5 3.2.2. Applying the Viterbi optimization algorithm {see 

step 3.5 on figure 3 and 2 . 8 on figure 2) 

At this stage, the hybrid ANN / HEM classification system has 
both a prior CLASS distribution, a CLASS transition 

10 probabilities matrix, and the posterior CLASS probabilities 
matrix, which depend on the time patterns and the observations. 
The viterbi algorithm can be applied to those data, provided 
that the application of Bayes' rule will transform the posterior 
CLASS probabilities into prior observation probabilities given 

15 the previous CLASS and the time pattern. 

Xt can be seen that there is no need to compute the markovian 
matrix of the observation probabilities given the CLASS. The 
Viterbi algorithm can therefore directly integrate the time- 

20 dependent observation probabilities given the current CLASS and 
the current time pattern. In other words, the state transition 
matrix and the state probability distribution have a static 
behavior (although they can be tuned to the context of the 
estimation) whereas the observation probabilities, given the 

25 previous CLASS, are depth-dependent. 

3.2.3. State-to-state or sample-by- sample classes 
transitions (see figure 4) 

30 In case of sample-by-sample classes transitions, the Viterbi 
algorithm is applied to all the INPUT DATA and associated 
estimated CLASSES probabilities samples. 
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In case of state- to-state transitions,- all the consecutive 
samples which have in common the same most probable CLASS (see 
step 4«1), are grouped together (see step 4.2) and considered as 
5 an unique observation element* The CLASS probabilities of all 
the samples belonging to that element are also averaged (see 
step 4,3), and the Viterbi algorithm is then applied to the 
groups of observations and not to each observation sample. 

10 The mean of computing the CLASS probabilities of the observation 
element can be either a plain mathematical average or a more 
complex average. 

In order to perform a state-to-state Viterbi optimization, the 
15 state transition probabilities and the state distribution have 
also to be computed on a state-to-state basis and not on a 
sample-by-sairtple basis (see 3*1.1 and step 4.4)". 

After the Viterbi optimization, the observation elements are 
20 split into INPUT DATA samples again, and the CLASS curve is 
displayed on a .sample-by-sample basis (see step 4.5) . 

4. Testing results 

25 The simple ANN and the Hybrid AltfN / HMM classifiers have been 
trained and tested on three different sets of geological data. 
As input data we have used downhole logs, and as classification 
results, the rock facies classes. 

30 4.1. Data set 1 (cored logs) 
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The first data set used for the training contained 490 samples 
of 4 logs (DT, GR, NPHI and RHOB) and associated core facies (7 
facies classes) , and were taken from real measurements performed 
in a well between the depth of 2975 - 3051 m. 

Once trained, the hybrid system has been tested on the 
measurements from the same well between depths 2923 - 3161 m,< 
which corresponded to 1562 samples from the same logs. 

Whereas a single ANN system showed an amount of approximately 40 
to 45 % correct predictions, the hybrid ANN / HMM system reached 
an accuracy of 45 to 55 %. 



15 



20 



25 



4.2. Data set 2 {cored logs) 

The data set used for the training contained 3800 samples of 5 
logs {DT, GR, NPHI, RT and RHOB) and associated core facies {13 
facies classes) , and were taken from real measurements performed 
in 4 wells between the depth of 8000 - 9000 feet. 

Once trained, the hybrid system has been tested on the 
measurements from 4 other wells of the same field, where core 
data were available. The accuracy of the results has been 
significantly increased, and tests are still currently being 
performed to assess the effect of several types of additional 
log attributes. 



4.3. Data set 3 {non-cored logs) 
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In this example, the training data set contained about 2000 
samples from one well. The testing' data set contained between 
1500 and 2000 samples per well in a field of 5 wells. 



5 The lithofacies learning set for the Hybrid ANN / HMM system was 
provided by the results of electrof acies predictions of an 
unsupervised neural network classifier • The stability of the 
predictions of the . latter systems has then been compared with 
the predictions of a plain ANN trained on the same lithofacies 
10 log curve. 

The Hybrid ANN / HMM system is more reliable than a single ANN 
system in the terms of prediction accuracy and gave less noisy 
results; it will therefore provide better geological lithofacies 
15 log curve estimations • 
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CLAIMS 

1. A system for inferring geological classes from oilfield 
well input data comprising a neural network for inferring 
5 class probabilities, characterized in that said system 

further comprises means for integrating class sequencing 
knowledge and optimising said class probabilities according 
to said sequencing knowledge. 

10 2. The system of claim 1, wherein the means for integrating 
class sequencing knowledge and optimising said class 
probabilities according to said sequencing knowledge 
comprises a hidden Markov model. 

15 3. A method for inferring geological classes from oilfiled 
well input data, comprising the following steps: 

inferring class probabilities with a neural network; 

and 

integrating class sequencing knowledge and optimising 
20 said class probabilities according to said sequencing 

knowledge . 

4. The method of claim 3, wherein the integrating class 
sequencing knowledge and optimising said class 
25 probabilities according to said sequencing knowledge is 

achieved according to a hidden Markov model. 
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ABSTRACT 



The invention relates to a system for inferring geological 
classes from oilfield well input data comprising a neural 
network for inferring class probabilities. According to the 
invention, the system further comprises means for integrating 
class sequencing knowledge and optimising said class 
probabilities according to said sequencing knowledge. 
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Fig. 3 
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